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ABSTRACT 

We applied the signed distance function (SDF) for 
representing the depths of atoms in a macromol- 
ecule. The calculations of SDF values were per- 
formed on grid points in a rectangular box that 
accommodates the macromolecule. The depth for 
an atom inside the molecule was then obtained as 
a result of tri-linear interpolation of SDF values at 
the nearest grid points surrounding the atom. For 
testing the performance of present program 
Adepth, we have constructed an artificial molecule 
whose atomic depths are known as the gold 
standard for accuracy assessments. On average, 
our results showed that Adepth reached an 
accuracy of 1.6% at 0.5 A of grid spacing, whereas 
the current reference server DEPTH reached 7.5%. 
The Adepth program provides both depth and height 
representations; it is capable of computing iso- 
surfaces for atomic depths and presenting graphical 
view of macromolecular shape at some distance 
away from the surface. Web interface is available 
at http://biodev.cea.fr/adepth. 



INTRODUCTION 

The atomic depth has been found extremely useful in 
studying hydrogen/deuterium exchange rates (1), the char- 
acterization of protein domain size and identification of 
potential targets for phosphorylation (2), quantifications 
of driving force for a burial residue (3), protein fold rec- 
ognition (4) and predictions of small-molecule binding 
cavities in proteins (5,6). As a closely related quantity, 
residue depth was used to study mutational impacts on 
protein stability (7) and the relationship of depth 
measure with protein sequences (8,9). 

In the late 70', macromolecular shape was described by 
a map of solvent exclusion, a method that could claim the 
first computation of atomic depths (10). Since then several 
methods have been devised to compute atomic depths in 



proteins. The first one used a pre-equilibrated solvent-bath 
in which the macromolecule of interest is immerged. 
Water molecules that bump with macromolecular atoms 
are removed from the bath, and the depth is assigned as a 
value of the distance between any atom and the nearest 
water molecule on the surface (7). Several other methods 
require computations of ASA to determine which atoms 
are buried beforehand (11). The depth of a buried atom is 
then measured as the distance between the atom and its 
closest solvent-accessible atom (2) or the closest vertex of 
ASA surface (8,9). Alternatively, one may attribute the 
depth index to each atom in the macromolecule by 
measuring the solvent-exposed volume of a virtual 
sphere with a varying radius located at the atomic centre 
(12). 

In this work, we present an unsupervised program, 
Adepth, which computes the atomic depth without 
requiring previous knowledge of ASA or preparations of 
a solvent bath. Another advantage of Adepth relies in its 
integrity on identification of surface atoms and quantifi- 
cation of atomic depths in one run. Validation of the 
program was performed on a well-designed molecular 
structure with known depth information (Figure 1). 
Applications of Adepth includes definition of protein 
skins with a user-defined thickness, calculations of 
atom-atom overlaps in macromolecular assemblies (14) 
and a graphical visualization of depth using iso-surfaces 
in VMD (13). Another useful application of Adepth 
concerns the assessment of tip-convolution effects in 
atomic force microscopy (AFM) imaging (15). In the 
range of grid spacing 0.5-1 A, Adepth has been shown 
to be more accurate than available servers that were 
tested in the present study. 

IMPLEMENTATION 

In the earlier work (16), the signed distance function 
(SDF) field of a protein allowed us to identify solvent 
accessible atoms and determine how deep a residue is 
inside the protein. In summary, we placed the protein in 
a rectangular grid of which width, length and height were 
determined by the size of the protein. The grid was 
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Figure 1. Representation of the 3D structure of the standard molecule 
drawn by the vdw method in VMD (13). Each atom was scaled down 
to 20% of its original size. The Cartesian co-ordinates of atoms in the 
molecule can be expressed as (Rcos<psin9, RsinipsinO, RcosO), where R 
is the radius of sphere, 0 < 9 < k and 0 < ip < 2n; totally there are 25 
sampling points along the 6 coordinate and 50 for the (p coordinate. 
Atomic depth is colour-coded as follows: cyan = OA, green = 3 A, 
orange = 6 A, yellow = 9 A and red = 12 A. 



uniformly discretized in three dimensions at user-defined 
grid spacing {Ax, Ay and Az). For each atom of the 
protein, the atomic van der Waals (vdw) radius was 
given based on the atomic type defined in the 
Charmm22 force field (17), as shown in Table 1. By 
default, the radius of solvent or probe was set to 1.5 A. 
For representing the solvent-accessible surface of the 
molecule, an extended radius, the sum of vdw and 
solvent radii, was assigned to each heavy atom. 
Following level set methods (18), we first computed the 
SDF values at interfacial grid points (level 0) using infor- 
mation of where are limits of atomic and solvent force 
fields. For a grid point (i j k) at next level, the SDF 
value, D s (i j k), was obtained by solving the quadratic 
equation (19) 

[D s (ijk) - Dyf + [D s (ijk) - D 2 f + [D s (ijk) - D 3 f = i 

Ax 2 Ay 2 Az 2 

where 

D y =mm[D s (i- \jk),D s {i+\jk)\, 

D 2 = mm[D s {ij- 1 k),D s (ij+l k)], 

D 3 = mm[D s (ij k-l), D s (ij k+l)] 

The computations were performed level by level until no 
un-visited grid point was left. The atomic depth was then 
obtained as tri-linear interpolation of SDFs at the nearest 
grid points surrounding the atomic centre. We denoted 
this strategy as Adepth. 



Table 1. Vdw radii used for atoms in Adepth 



Atomic type A 



H (polar) 0.225 

H (non-polar) 1.320 

C 2.175 

N 1.850 

O 1.800 

S 2.200 

P 2.150 



The performance of Adepth depends on parameters, 
such as grid spacing, probe radius and atomic vdw radii, 
used. The finer grid spacing the more accurate results, we 
found that grid spacing of 0.5 A is a good compromise 
between computational cost and accuracy (Figure 2A 
and B). Accounting for atomic size effects on atom 
depths, we scaled the SDF magnitude by the atomic 
extended radius. In the output of PDB format, the 
atomic depths and scaled ones are presented in B-factor 
and occupancy columns, respectively. In the web server, 
an option allows the user to obtain the output of DX 
format so that one can display 3D-field data in the iso- 
surface form using VMD (13). Moreover, the user can also 
request an extruded slice of a macromolecule with pre- 
defined thickness as the surface skin of the macromolecule 
(14). 

VALIDATION 

As there is no analytical solution to quantification of 
atomic depths for the 3D structure of a protein or a 
nucleic acid, a novel method is usually validated by 
comparing with other available methods. A qualitative 
comparison between various methods on different macro- 
molecular architectures is presented in the Supplementary 
Figure SI: a nucleic acid [1BNA (20)], a fibrous protein 
collagen [1CGD (21)], a large human coagulation factor 
VIII [3CDZ (22)] and the 30S ribosome subunit [2WDK 
(23)]. However, the quantitative assessment of these 
methods remains elusive. A new approach for evaluating 
the performance of depth computation is presented. A 
pseudo-molecule was created as a standard structure 
that consists of one-atom (C 01 ) residues, namely, UNK 
(UNK means of unknown type; atomic co-ordinates of 
the gold-standard molecule in the PDB format are avail- 
able in the web server). The 3D structure of the standard 
molecule is formed of four equally spaced (3 A) concentric 
spheres as shown in Figure 1. For the present purpose, 
how dense the residues packed is not the matter, but 
how much the space filled up is concerned. Totally, there 
are 1201 residues (or atoms) in each spherical layer, and 
the outermost sphere has a radius of 12 A. 

As described in 'Implementation' section, probe radius 
and atomic radii are factors greatly influencing the results 
of atomic depth. We have performed calculations on the 
standard molecule with several values of grid spacing Ar 
(0.5-2.0 A) such that Ar = Ax = Ay = Az, as well as with 
different probe radii, 1.4-1. 6 A. Among available web 
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servers, only DEPTH server (6) could take in the standard 
molecule and go through the entire calculation process, 
whereas others such as the DPX servers (24) aborted the 
run because of failure of the associated surface program, 
whereas the SADIC (25) and PocketDepth servers (5) did 
not return any results. It should be noted that the inter- 
sphere distance is measured according to positions of 
atomic centres, whereas the depth of an atom depends on 
the distance between the atomic centre and the probe 
centre. Consequently, to make a direct comparison of 
depth computations and inter-sphere distances for the 
standard molecule, one has to subtract the extended 
radius of an atom from the depth value of that atom. To 
be explicit, the atomic depth was reduced by 2.175 plus 
1.5 A from Adepth and by 1.53 plus 1.5 A from the 
DEPTH server (the value of 1.53 A was obtained from 



A 2.0 



1.5 

o 
o 

E 1.0 
0. 

o 



0.5 



0.0 



■*- Ar = 0.5 A 
Ar = 0.6A 
• Ar = 0.8 A 
-B- Ar= 1.0 A 



0 2000 4000 6000 8000 

Natom 



B12.01 



•< 9.0 



5 5S 



a 5 a 



S 2 5 



n o a 



o. 

a> 

73 



jHjh ^ ? - !r ^ T ____. 

re 
*_ 
tt 

> V 

< 3.o--4-5-5-5-f-f-f-5-5- ? -5- 2 - ? - 5 - ? - ? 



0.0^ 



s o s <? o a xs a - o" c> o 



0.5 



1.0 



1.5 



2.0 



Ar(A) 



Figure 2. (A) Relationship of computational cost and molecular size. 
The CPU time was plotted against the number of atoms (N alom ) of the 
standard molecule for Ar = 0.5, 0.6, 0.8 and I.OA. The four sets of 
data values were fitted with linear regression, resulting in a slope of 0.2, 
0.1, 0.07 and 0.05 msec/atom for Ar = 0.5, 0.6, 0.8 and 1.0 A, respect- 
ively. (B) Validation of Adepth and DEPTH on atomic depth calcula- 
tions for the standard molecule using a probe radius of 1.5 A. In the 
plot, the horizontal dashed lines represent the distances of spherical 
layers from the surface. Data points with open symbols represent the 
values of averaged atomic depth versus Ar from Adepth, whereas that 
with filled symbols located along the vertical axis are referred to the 
DEPTH server. Bars around the open symbols represent the standard 
deviation; when it is <1%, bars are not drawn. 



the carbon-atom type Amber force field, the same force 
field used to equilibrate the solvent-bath in the DEPTH 
server). The results of Adepth and DEPTH are plotted in 
Figure 2B where the data of DEPTH were calculated using 
default parameters in the website. Adepth results for the 
probe radius of 1 .4 and 1 .6 A are not shown, as their values 
are close to that of 1.5 A. Each data point in the plot rep- 
resents an averaged depth over all atoms in a spherical 
layer. Despite deviations from true depths become 
evident at large grid spacing, the order of depth ranking 
from the Adepth results remains unchanged (Figure 2B). 

The computational accuracy of atomic depth is defined 
as a ratio of measured depth to the true depth. The 
measured depth was obtained as data values of averaged 
depth shown in Figure 2B. The dependence of computa- 
tional accuracy on grid spacing is presented in Table 2. 
Comparing the depth accuracy, we found that globally 
Adepth is more accurate than the o DEPTH server at 
small to medium grid spacing (~1A). Nevertheless, for 
the deepest layer, Adepth can obtain better results that 
DEPTH at fine grid spacing, i.e. 0.1 A, which demands 
high CPU cost and huge memory load. 

DISCUSSION 

The relationship between protein interior and atomic 
depth has been extensively discussed (2,7,25). New impli- 
cations of atomic depths calculated by the Adepth server 
will be illustrated later in the text. 

Once atomic depths for a macromolecule are calculated, 
a built-in option in the Adepth server allows the user to 
obtain a molecular slice based on values of atomic 
depths. The corresponding outputs are called 'extruded 
surfaces' that are useful in molecular docking and 
assembly. It is usually done through an exhaustive compu- 
tation of inter-atomic distance for detecting atomic 
overlaps in a complex. In practice, this is not necessary 
because the desired information is to find bumping atoms 
at the interface (14). Guided by an extruded surface of a 
molecular complex, one may reduce the number of atoms in 
distance calculations by ~4-10 times while maintaining an 
accurate contour for the complex molecule (Figure 3A). 

Another novel feature of Adepth allows the user to visu- 
alize 3D iso-surface mapping of atomic depths. As an SDF 
value carries either sign, it is now possible to accurately 
display an expanded volume of a given macromolecule. 



Table 2. Comparison of computational accuracy between DEPTH 
and Adepth" 



Layer 


True 


DEPTH 


Ar = 0.5 


Ar = 1.0 


Ar = 2.0 




depth 


(%) 


(%) 


(%) 


(%) 


1 


3 


17.3 


2.6 


1.1 


S.5 


2 


6 


7.4 


1.5 


1.0 


6.8 


3 


9 


5.1 


0.2 


2.4 


9.1 


4 


12 


0.3 


2.2 


6.7 


13.9 



a The innermost sphere of the standard molecule is labelled 4 (red in 
Figure 1), and the layer enumeration is decreased by 1 for next 
outwards sphere; computations with different grid spacing were per- 
formed only by Adepth; true depth and Ar are in the unit of A. 
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Figure 3. Implications of new representation for atomic depths. (A) 
Cartoon representation of the extruded crystal structure of a Fab 
domain of antibody protein using aprobe radius of 1.5 A, Ar = 1 A, 
and depth selection threshold <3.5A. Of 3291 atoms, 805 are found 
within the given threshold. (B) Iso-surface representation for an 
expanded surface of an antibody molecule. The crystal structure of 
the antibody is represented by 1IGT (26) with vdw spheres and 
coloured atomic types: cyan for carbon, red for oxygen, blue for 
nitrogen and yellow for sulphur atoms. The atomic depths of 1IGT 
and the associated SDF values at grid points were computed using a 



This is convenient for examining nearby surfaces of a 
macromolecule while still tracking the distance from the 
surface atoms. Analogous results could be obtained by 
traditional computations of ASA; yet, it is not plausible 
for most surface programs because of the need of a huge 
probe for generating such a remote surface at 50 A away 
as shown in Figure 3B. This feature is useful in AFM for 
visualizing putative tip-convolution effects. As known, the 
tip effects lead to dilations of object boundary in AFM 
images (15,28). With a large radius of tip apex, the tip 
convolution effect greatly smoothed the macromolecular 
shape. When an expanded surface of a macromolecule can 
be displayed, it is easier to grasp an idea of what to expect 
from AFM imaging on a single molecule (Figure 3B). 
As tip convolution effect mostly occurred as a 2D artefact 
in an AFM image, one may scrutinize the strict 3D repre- 
sentation provided by Adepth in certain direction for 
viewing expected convoluted surfaces for the 
macromolecule. 

The final feature of Adepth is the use of 3D iso-poten- 
tial surface for constructing a graphic representation of 
protein cores. Atomic depth is particularly well adapted 
to the identification of protein core residues as illustrated 
in the ProCoCoa method (25). As previously suggested, 
the residue depth is an important parameter for interpret- 
ing mutational data in proteins (7) and correlates well with 
residue hydrophobicity (2,3). Iso-surfaces provide an in- 
tuitive interpretation of atomic depth. For instance, 
graphical representation of atomic depth was helpful in 
illustrating the puzzling clinical data regarding the 
impact of a single-nucleotide polymorphism on activities 
of activated coagulation factor VII (FVIIa). Two appar- 
ently similar missense mutations were found in unrelated 
patients having a gene defect in FVII: Gly343Ser and 
Gly391Ser; however, the phenotype of Gly343Ser was 
severe, whereas that of Gly391Ser was minor (29). Using 
the iso-surface representation of the ternary complex 
model (27), it can be observed that although both 
Gly343 and Gly391 are completely buried (Figure 3C), 
Gly343 is located deeper in the hydrophobic core than 
Gly391. Thus, the apparent paradox could be interpreted 
in term of depth rather than solvent-burial. Although this 
observation remains phenomenological, it conveys the 
already known idea that a deep mutation as a greater 
impact on protein structure than a solvent-exposed 
mutation (7). 



Figure 3. Continued^ 

probe radius of 1.5 A at Ar = 1 A. The iso-surface in orange wireframe 
was located 50 A away from the antibody surface. (C) Clipped core 
structure of the serine protease domain of FVIIa (27). The orange 
wireframe represents the iso-surface of the SDF value at a threshold 
of 7.5 A using VMD. The main-chain conformation is displayed in a 
ribbon of red-to-white-to-blue spectrum, where red and blue are for 
buried and solvent-exposed regions, respectively. The deepest atom in 
FVIIa is 13.5 A inwards from the protein surface. The mutated glycines 
with almost no solvent-accessible surface areas are represented by 
coloured CPK spheres. The residue depth was calculated as 10.0 A 
for Gly343 and 4.8 A for Gly391; thus, only Gly343 is enclosed in 
the orange wireframe. 
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SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Figure 1. 
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